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INTRODUCTION 

One  major  function  of  the  carbohydrates  in  glycoproteins  and  glycolipids  is  to 
present  information  in  the  form  of  three-dimensional  structures.  The  information  in 
oligosaccharide  structures  is  decoded  through  the  binding  of  carbohydrate  recognizing 
proteins  called  lectins.  Involvement  of  lectins  in  diverse  biological  phenomena,  from 
intracellular  routing  of  glycoproteins  to  cell-cell  adhesion  and  phagocytosis,  has  been 
reported  (1).  Especially  significant  is  the  progress  made  in  elucidating  the  role  of 
selectins,  a  subgroup  of  C-type  lectins,  in  inflammation  (2-4).  Under  physiological  flow 
conditions,  leukocytes  interact  with  activated  vascular  endothelium  through  leukocyte 
rolling,  followed  by  firm  adhesion,  spreading,  and  extravasation.  Selectins,  which  bind  to 
oligosaccharide  structures  called  sialyl  Lex  and  sialyl  Lea,  were  shown  to  participate  in 
the  rolling  process.  Since  both  leukocytes  and  tumor  cells,  in  particular  carcinomas  and 
leukemias,  express  these  oligosaccharides  on  their  cell  surface,  it  was  postulated  that 
tumor  cells  might  use  selectin-carbohydrate  interaction  during  metastasis.  Although 
limited,  existing  experimental  data  support  this  hypothesis.  Carbohydrate-protein 
interactions  are  not  restricted  to  selectins.  Other  lectins  may  be  used  by  tumor  cells 
when  interacting  with  carbohydrates,  and  these  lectins  may  also  play  an  important  role 
in  progression  and  metastasis.  A  significant  difference  was  observed  in  the  degree  of 
glycosylation  of  the  mucin,  as  well  as  in  the  N-glycosylation  pattern  of  glycoproteins 
between  normal  and  cancer  breast  cells  (5-8).  Therefore,  to  explore  that  possibility 
breast  cancer  was  chosen  as  the  prime  target  of  our  study. 

The  primary  goal  of  this  proposal  is  to  identify  lectins  that  are  either  produced  by 
breast  cancer  cells  or  that  are  capable  of  binding  to  cell  surface  carbohydrate  structures 
of  the  breast  cancer  cells.  In  order  to  identify  these  lectins  we  chose  to  employ  a 
powerful  technique  known  as  phage  display  (9-12).  Use  of  the  recently  introduced  T7 
phage  system  allowed  us  to  express  proteins  up  to  1200  amino  acids,  which  might  be 
sufficient  to  contain  the  carbohydrate-recognition  domain  (CRD)  of  lectins. 
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BODY 

Task  0.  To  prove  the  feasibility  of  phage  display  strategy  in  cloning  cDNA  encoding 
proteins  with  carbohydrate-binding  property 

From  our  experiments  with  a  HeLa  cell  phage  display  cDNA  library  and  an 
isolated  blood  group  H-specific  glycoprotein  fraction,  we  obtained  results  suggesting  the 
possible  utility  of  the  phage  display  strategy  in  identifying  carbohydrate  binding  proteins 
in  vitro.  By  using  this  glycoprotein  fraction  as  bait,  amplification  from  the  library  of 
multiple  phage  clones  expressing  sequences  (13-16)  from  galectin-3  was  observed. 
Galectin-3  is  a  member  of  S-type  lectins  (17-19),  and  an  affinity  with  the  blood-group 
substance  has  been  reported  (19,20).  Before  expanding  our  research  to  breast  cancer, 
we  proved  that  amplification  of  galectin-3  phage  display  clones  by  in  vitro  biopanning 
was  not  incidental  but  rather  selective.  Southern  hybridization,  as  well  as,  plaque 
hybridization  was  used  to  prove  the  selectivity.  Results  from  this  portion  of  work  have 
been  published  (Yamamoto,  M.,  Kominato,  Y.,  and  Yamamoto,  F.  (1999).  Phage  display 
cDNA  cloning  of  protein  with  carbohydrate  affinity.  Biochem.  Biophys.  Res.  Commu. 
255:  194-199),  and  the  reprint  is  located  in  the  Appendix. 

Task  1.  To  obtain  phage  display  cDNA  clones  which  specifically  bind  to  isolated 
glycoproteins  and  cell  surfaces 

Construction  of  phage  display  cDNA  libraries 

We  have  constructed  a  total  of  11  phage  display  cDNA  libraries  in  T7Select  1-1  vector 
where  eight  of  these  libraries  contained  more  than  one  million  independent  clones.  The 
size  and  titers  of  the  amplified  libraries  were  determined  and  listed  in  Table  1. 

*  In  vitro  biopanning  to  isolate  clones  from  phage  display  cDNA  libraries 

In  addition  to  the  blood  group'glycoprotein  fraction  from  gastric  mucosa,  we  used 
commercially  available  purified  glycoprotein  and  glycoprotein  fractions  for  in  vitro 
biopanning  experiments.  We  also  used  antibodies  against  plant  lectins  as  bait. 

The  list  of  glycolipids,  glycoproteins,  and  antibodies  used  for  in  vitro  biopanning  are  as 
follows. 

Glycoproteins: 

Blood  group  H-expressing  mucin  fraction  from  gastric  mucosa,  al -Acid  Glycoprotein 
from  human  serum,  Fibrinogen  from  human  plasma,  extracellular  matrix  glycoprotein 
fraction  of  Zonae  pellucida  from  porcine  ovary,  and  sialyl  Lewis  x-BSA 

Glycolipids: 

Trihexosylceramide  (CTH),  Asialoganglioside-GMI ,  Asialoganglioside-GM2, 
Monosialoganglioside-GMI,  Monosialoganglioside-GM2,  and  Monosialoganglioside- 
GM3 

Antibodies: 
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Biotinylated  anti-Phaseolus  vulgaris  agglutinin,  biotinylated  anti-Concanavalin  A  (Con 
A),  biotinylated  anti-Peanut  agglutinin  (PNA),  and  biotinylated  anti-wheat  germ 
agglutinin  (WGA) 

MaxiSorp  96-well  plates  were  coated  with  glycoproteins  and  used  for  binding  selections. 
Some  glycolipids  were  coated  onto  96-well  PolySorp  plates  while  other  charged 
glycolipids  were  spotted  onto  a  polyvinylidene  difluoride  (PVDF)  membrane  (21). 
Biopanning  of  antibodies  was  performed  using  streptavidin-coated  magnetic  beads 
(Dynal). 

*  In  vivo  biopanning  to  isolate  clones  from  phage  display  cDNA  libraries 

In  vivo  biopanning  experiments  were  performed  using  EJG  bovine  capillary  endothelial 
cells  and  MCF-7  and  BT-20  human  mammary  carcinoma  cells  as  baits. 

*  In  situ  biopanning  to  isolate  clones  from  phage  display  cDNA  libraries 

We  performed  In  situ  biopanning  using  frozen  breast  tissue  sections  obtained  from  the 
University  of  Michigan  Breast  Cancer  Tissue  Bank. 

Task  2.  To  characterize  the  selected  phage  populations  as  well  as  isolated  individual 
clones 

‘Monitoring  the  enrichment  of  certain  phage  clones  by  PCR  using  lysates  from  selected 
population  of  phages,  determining  nucleotide  sequences  cDNA  inserts,  and  identifying 
the  clones  of  interest 

In  vitro  biopanning 

After  the  fourth  or  fifth  round  of  in  vitro  biopanning,  PCR  amplification  of  cDNA  inserts 
was  performed  and  the  PCR  products  were  analyzed  by  agarose  gel  electrophoresis. 
Several  discrete  bands  were  observed  from  each  selection,  and  the  selections  were 
repeated  using  this  variety  of  glycoproteins,  glycolipids,  and  antibodies  as  bait.  These 
bands  were  excised  from  the  gel,  DNA  was  gel-purified,  and  the  nucleotide  sequences 
of  the  cDNA  inserts  were  determined  directly  or  after  cloning  into  a  plasmid  vector.  The 
nucleotide  sequences  of  153  bands  were  determined  (Table  2).  Homology  search  was 
performed  using  the  BLAST  program.  Unexpectedly,  however,  none  of  the  phages 
contained  the  cDNA  sequences  encoding  the  same  proteins  except  that  two  identical 
clones  encoding  benzodiazapine  receptor  were  obtained  using  two  different  antibodies 
against  plant  lectins  as  bait. 

In  vivo  biopanning 

Different  from  the  above-mentioned  in  vitro  biopanning  experiments,  no  discrete  bands 
were  observed  after  the  sixth  round  of  selection.  Apparently  many  phage  clones  bound 
to  the  cell  surface  (and/or  cell  culture  plate  substratum).  We  next  examined  whether 
phages  could  be  further  screened  based  on  stimulatory/inhibitory  activity  on  cell  growth. 
For  this  purpose,  6x48  clones  (may  not  be  completely  independent)  were  randomly 
selected  from  the  fifth/sixth  rounds  of  EJG  cell  selections  and  infected  to  the  host 
bacteria.  Phage  lysates  were  then  added  onto  the  EJG  cell  culture  in  24-well  dishes  in 
duplicate.  After  72  hours  of  incubation,  cell  growth  was  monitored  by  MTT  assay  (22). 
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Three  clones  each  showing  the  highest  or  the  lowest  values  in  duplicate  MTT 
experiments  were  selected  out  of  the  respective  set  of  48  clones.  Therefore,  a  total  of 
36  clones  were  analyzed  in  the  second  round  of  MTT  assays.  We  anticipated  that  those 
phage  clones  would  exhibit  the  same  tendency  as  observed  in  the  first  round  forming 
two  separate  populations  (one  with  stimulatory  activity  and  the  other  with  inhibitory 
activity  on  EJG  cell  growth).  Disappointing  results  were  obtained.  Some  of  the  clones 
previously  identified  to  enhance  the  cell  growth  in  the  first  round  of  selection  turned  out 
to  repress  the  growth  in  the  second  round,  and  vice  versa.  We  went  on  the  next 
screening  hoping  to  identify  several  true  candidates  out  of  incidentally  selected  clones, 
but  the  third  selection  was  not  successful  either.  Again,  no  separation  between  growth 
stimulatory  clones  and  growth  inhibitory  clones  were  observed.  Results  were  random 
and  not  reproducible.  Since  the  phage  titers  were  approximately  5x1 07  pfu/jJ  and  500  nl 
of  lysate  was  added  to  a  2ml  of  cell  culture,  the  concentrations  of  the  fusion  proteins 
(5x1 07  x  500  molecules/2. 5ml  =  1013  molecules/I  =  1013/6.02x1023  mol/l  =  17  x10'12mol/l 
=  17pM)  may  have  not  been  sufficient  to  induce  growth  changes.  In  order  to  achieve 
higher  concentration  of  fusion  proteins,  transfer  of  the  cDNA  inserts  from  phage  display 
vector  to  a  eukaryotic  expression  vector  may  be  necessary.  We  sequenced  72  phage 
clones  from  the  population  selected  with  EJG  cells  as  bait,  and  they  are  listed  in  Table 
2. 

In  situ  biopanning 

After  the  fifth  round  of  biopanning,  several  discrete  bands  were  observed.  We  plated 
phages  at  low  density  and  PCR-amplify  cDNA  inserts  from  individual  plaques. 
Nucleotide  sequences  of  20  inserts  were  determined.  Except  that  2  sequences  were 
from  the  same  clone,  no  multiple  clones  encoding  the  same  protein  were  observed. 

Task  3.  To  characterize  the  identified  genes  and  their  gene  expression 
‘Performing  Northern  hybridization  and/or  RT-PCR  to  determine  the  mRNA  level  using 
RNA  from  normal  and  tumor  breast  cell  lines,  and  performing  Southern  hybridization 
using  DNA  from  normal  and  tumor  breast  cell  lines 

We  have  not  performed  these  experiments  since  no  true  candidates  were  obtained  from 
Task  2.  We  obtained  many  interesting  candidate  clones,  however,  we  were  unable  to 
obtain  a  multiple  number  of  different  clones  encoding  the  same  protein.  In  addition, 
many  clones  encoded  relatively  abundant  proteins.  These  included  glutaraldehyde  3 
phosphate  dehydrogenase  and  ribosomal  proteins.  Further,  the  directions  and  coding 
frames  of  the  cDNA  sequences  varied.  These  results  suggested  that  these  candidates 
might  have  been  technical  artifacts  rather  than  true  candidates.  Therefore,  we  tested 
that  possibility.  We  used  purified  glycoproteins  as  baits  and  the  previously  selected 
secondary  or  tertiary  round  of  phage  lysates  instead  of  cDNA  libraries.  Complementary 
DNA  libraries  contained  millions  of  different  clones,  whereas  those  lysates  contained 
hundreds  (or  thousands)  of  clones  and  some  clones  were  already  enriched.  Therefore, 
the  same  clones  that  were  enriched  after  the  fourth  or  fifth  round  of  biopanning  starting 
from  library  were  expected  to  enrich  at  the  earlier  round.  The  results  showed  the 
absence  of  selectivity,  suggesting  opportunistic  enrichment.  Then  we  have  changed  the 
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binding  and  washing  conditions  as  well  as  the  method  of  refreshing  phage  libaries  (23) 
without  success. 


KEY  RESEARCH  ACCOMPLISHMENTS 

•  Demonstration  of  the  phage  display  technique  to  clone  cDNA  of  carbohydrate¬ 
binding  protein 


REPORTABLE  OUTCOMES 

•  A  paper  reporting  the  feasibility  of  phage  display  approach  to  clone  cDNA  encoding 
carbohydrate-binding  protein  was  published. 


CONCLUSIONS 

Although  we  could  prove  the  feasibility  of  the  phage  display  technique  to  clone 
carbohydrate-binding  proteins,  we  have  not  succeeded  in  identifying  previously 
unrecognized  lectin  molecules.  In  vitro  and  in  situ  biopannings  produced  many 
candidates,  however,  none  came  from  different  cDNA  clones  encoding  the  same 
proteins  except  for  galectin-3.  Results  from  in  vivo  biopanning  suggested  the  necessity 
of  functional  analysis. 
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APPENDIX 


Table  1.  List  of  phage  display  cDNA  libraries 


MCF-7  Breast  adenocarcinoma 

MCF-7  Breast  adenocarcinoma 

PC-3  prostate  adenocarcinoma 

PC-3  prostate  adenocarcinoma 

HeLa  cell  transfectant 

HeLa  cell  transfectant 

HeLa  cell  transfectant 

EJG  bovine  capillary  endothelial  cell 

Human  liver 

Human  liver 

Human  lymph  node 


1.06x105  clones 
1.26x107  clones 
1.94x1 0s  clones 
1.63x107  clones 
1.77x106  clones 
3.05x10s  clones 
5.29x10s  clones 
1.54x107  clones 
4.68x1 05  clones 
2.55x1 07  clones 
4.63x10s  clones 


4x1 07  pfu/pl 
7.23x1 07  pfu/pU 
4.4x1 07  pfu/pl 
2.80x1 07  pfu/pl 
3.9x1 07  pfu/pl 
3.25x1 07  pfu/pl 
3.44x1 07  pfu/pl 
2.80x1 07  pfu/|il 
5x1 07  pfu/jil 
3.48x1 07  pfu/pl 
5.27x1 07  pfu/pl 


The  number  of  the  independent  clones  and  the  titers  of  the  amplified  libraries  are  shown. 


Table  2.  Sequence  homology  of  the  phage  clones  obtained  by  selections 


(I).  In  vitro  selection 

A.  Glycoproteins 

1.  Blood  group  H-expressing  mucin  fraction 


M 

X89718 

1.9e-16 

Putative  26S  protease  subunit  mRNA 

M 

AA 134606 

l.le-88 

EST 

M 

X98614 

2e-92 

cytokeratin  8  mRNA 

M 

AC002558 

e-1 14 

chr.  17  identical 

M 

AA627181 

e-108 

EST 

M 

X06617 

e-1 28 

ribosomal  protein  S 1 1  mRNA 

M 

AI394640 

e-1 53 

EST 

M 

J05176 

e-113 

alpha  1  -antichymotrypsin  mRNA 

M 

X90826 

e-1 70 

USF2  mRNA 

M 

AA040063 

e-1 62 

EST 

M 

D38112 

6e-32 

mitochondrial  DNA 

MPELiLy 

Y13710 

0.0 

alternative  activated  macrophage-specific  CC  cytokine  mRNA 

MPELiLy 

T89015 

2e-19 

KIAA  0220  mRNA 

MPELiLy 

AA225548 

3e-30 

EST/AIu 

MPELiLy 

S4 16941 

6e-4 

PTMAP2  mRNA 

MPELiLy 

L41066 

7e-53 

NF-AT3  mRNA 

MPELiLy 

U90426 

0.0 

nuclear  RNA  helicase  mRNA 

MPELiLy 

AA83124 

3e-15 

Alu 

MPELiLy 

AI3 15224 

2.2 

MPELiLy 

AA832124 

3e-15 

Alu 

MPELiLy 

Z54072 

e-17 

Alu 

MPELiLy 

AC004383 

0.24 

MPELiLy 

M36682 

e-36 

laminin-binding  protein  (galectin-3)  mRNA 

MPELiLy 

AI253442 

6e-50 

cytochrome  C  mRNA 

MPELiLy 

AC002487 

4.9 

MPELiLy 

AC004896 

3e-12 

Alu 

MPELiLy 

H75348 

2e-25 

MPELiLy 

AA71349 

4e-l  19 

EST 

MPELiLy 

AC00491 1 

2.1 

MPELiLy 

U63090 

3e-67 

alpha  2,3  sialyltransferase  mRNA 

MPELiLy 

W33034 

2e-99 

EST 

MPELiLy 

R76396 

4e-23 

Alu 

MPELiLy 

AA804821 

e-60 

ATP  synthetase  mRNA 

MPELiLy 

AC005224 

3e-46 

Alu 

MPELiLy 

AA38431 1 

e-1 3 

EST 

MPELiLy 

AF042506 

4e-91 

cytochrome  b  mRNA 

MPELiLy 

AA327241 

0.008 
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MPELiLy 

Y00097 

3e-36 

p68  protein  mRNA 

MPELiLy 

AC00310 

6e-38 

Alu 

MPELiLy 

X75362 

0.014 

MPELiLy 

AA687598 

6e-54 

EST 

MPELiLy 

S80305 

7e-29 

beta2  glycoprotein  I  mRNA 

MPELiLy 

U24576 

7e-84 

breast  tumor  autoantigen  mRNA 

MPELiLy 

X02308 

e-136 

thymidylate  synthetase  mRNA 

MPELiLy 

U35451 

0.0 

heterochromatin  protein  p25  mRNA 

MPELiLy 

X93334 

0.0 

mitochondrial  DNA 

MPELiLy 

AC003034 

e-120 

BAC  clone 

MPELiLy 

Z84480 

0.002 

2.  Sialyl  Lex-BSA 

MPELiLy  AF069378.1 

3e-82 

Insulin-like  growth  factor  receptor  II 

MPELiLy 

Z97353.3 

3e-55 

ribosomal  protein  L10  mRNA 

MPELiLy 

L06237 

5e-9 

microtubule-associated  protein  IB  mRNA 

MPELiLy 

NM  001306.1 

4e-60 

Claudin  3  mRNA 

3.  Zonae  pellucida  glycoprotein 

fraction 

MPELiLy 

U93305 

6e-10 

A4  differentiation  dependent  triple  LIM  domain  protein  mRNA 

MPELiLy 

NM  000970.1 

e-112 

ribosomal  protein  L27a  mRNA 

4.  Fibrinogen 
MPELiLy 

U63290 

0.54 

5.  alpha  1  Acid  glycoprotein  fraction 

MPELiLy 

AA996393 

5e-7 

EST 

B.  Glycolipids 
l.CTH 

MPELiLy 

AJ000039 

3e-93 

G3PDH 

MPELiLy 

AF004338 

e-147 

16S  rRNA 

MPELiLy 

Y11395 

0.0 

p40  protein  mRNA 

MPELiLy 

D38524 

e-168 

5’  nucleotidase  mRNA 

MPELiLy 

AJ1 30972 

0.0 

U2  sn  RNP-specific  A  protein  mRNA 

2.  Asialo  GM2 
MPELiLy 

AJ1 30972 

e-96 

U2  sn  RNP  mRNA 

MPELiLy 

ABO  12622 

3e-97 

Tapasin  mRNA 

MPELiLy 

AF055004 

0.0 

KIAA0763  gene  mRNA 

MPELiLy 

X04225 

e-112 

HCC  alpha  I  microglobulin  mRNA 

MPELiLy 

Z84480 

0.002 

3.  Asialo  GM1 
MPELiLy 

X86343 

e-157 

complement  C7  mRNA 

MPELiLy 

ABO  12622 

3e-97 

Tapasim  mRNA 

MPELiLy 

D88984 

e-132 

Ampd  3  gene 

MPELiLy 

S74678 

e-123 

heterogeneous  nuclear  RNP  complex  K  mRNA 

MPELiLy 

M29064 

5e-06 

hnRNP  B 1  protein  mRNA 

MPELiLy 

Z75894 

0.14 

4.  GM3 

MPELiLy 

NM  003191.1 

4e-59 

thr-tRNA  synthetase  mRNA 

MPELiLy 

R78124 

1,7 

MPELiLy 

X85133 

3e-49 

p53  associated  RBQ-1  mRNA 

MPELiLy 

AJ000039 

7e-40 

G3PDH  mRNA 

5.GM2 

MPELiLy 

U96922 

3e-93 

inositol  polyphosphate  4-phosphatase  type  II  mRNA 

MPELiLy 

U68758 

0.0 

pyroline-5-carboxylate  synthetase 

MPELiLy 

AF035429 

2e-97 

cytochrome  oxidase  subunit  I  mRNA 

MPELiLy 

AI093867 

e-144 

HoxAlOmRNA 

MPELiLy 

Y11395 

0.0 

p40  protein  mRNA 

MPELiLy 

U011184 

e-112 

flightless  gene  product  mRNA 

MPELiLy 

X56932 

2e-66 

23 kD  highly  basic  protein  mRNA 

MPELiLy 

AC005303 

0.11 

MPELiLy 

Ml  1233 

e-112 

cathepsin  O  mRNA 

6.GM1 

MPELiLy 

X85786 

0.0 

DNA  binding  regulatory  factor  mRNA 

MPELiLy 

L05095 

0.0 

ribosomal  protein  L30  mRNA 

MPELiLy 

AA977240 

e-176 

EST 
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MPELiLy  D42039  0.0 


C.  Antibodies 

1.  anti-Phaseolus  vulgaris  agglutinin 


H 

NM00714 

e-139 

H 

NM001457 

e-148 

H 

AL022068.2 

0.015 

H 

AL006943.25 

0.072 

H 

NM001527 

4e-26 

H 

AW058612.1 

2e-60 

H 

NC00 1807.2 

e-142 

2.  anti-Concanavalin  A  (Con  A) 

H 

NM001010.1 

e-117 

H 

NM002687. 1 

7e-80 

H 

NM002032. 1 

6e-98 

H 

NM00637L1 

e-132 

H 

Mil  167. 1 

e-106 

H 

NM00 1067.1 

e-156 

H 

AC004829.2 

4.7 

H 

NM002392. 1 

0.0 

H 

G25247 

8e-57 

H 

U89311.1 

e-38 

3.  anti-Peanut  agglutinin  (PNA) 

H 

X92545 

2e-6 

H 

AF078845.1 

0.0 

H 

AL 136369 

0.001 

H 

G25835 

3e-7 

H 

AF151080 

0.0 

H 

NM000985.1 

e-141 

H 

AA480940.1 

6e-88 

H 

NM005566.1 

0.0 

H 

NM005688 

e-155 

MPELiLy 

NM003380 

0.0 

MPELiLy 

AP000694 

5e-92 

MPELiLy 

NM000967 

0.0 

MPELiLy 

NM000062.1 

0.0 

MPELiLy 

NM000090.1 

7e-48 

MPELiLy 

NM002948.1 

e-127 

MPELiLy 

AW665473.1 

1.1 

MPELiLy 

NM007315.1 

e-113 

B 

AJ242682 

3e-66 

B 

AL008639.15 

0.35 

B 

AC009498.3 

9e-82 

B 

NM00101 1 

5e-77 

B 

NM002100 

9e-36 

B 

AW572242. 1  8e-54 

4.  anti-wheat  germ  agglutinin  (WGA) 

H 

ACC005924.2 

3e-84 

H 

AC005606 

2e-52 

H 

NM000714.2 

e-148 

H 

AP001040 

e-131 

H 

ABO  1943  8 

e-101 

H 

NM007104 

e-114 

H 

NM002046 

e-150 

(II).  In  vivo  biopanning 

A.  EJG  cells 

MPH 

U 14970 

3e-113 

MPH 

X73974 

4e-140 

MPH 

X01630 

3e-171 

MPH 

AA722510 

5e-133 

MPH 

S35960 

4e-155 

MPH 

AA427423 

7.1e-65 

KIAA  0081  gene  mRNA 


benzodiazapine  receptor 
anti-binding  protein  278 


histone  deacetylase  2 
EST 

mitochondria 

ribosomal  protein  S6 
desmosome  associated  pirin 
ferritin,  heavy  polypeptide  1 
cartilage  associated  protein 
28S  rRNA 

topoisomerase  II  alpha 

double  minute  2  p53  binding  protein 
EST 

nucleophosmin  phosphoprotein 
vigilin 

16.7kDa  protein 
STS 

HSPC  246  mRNA 
ribosomal  protein  L17 
EST 

lactose  dehydrogenase  A 
ATP  binding  protein 
vimentin 

ribosomal  protein  L3 
complement  component  1  (Cl)  inhibitor 
typelll  collagen 
ribosomal  protein  LI 5 

signal  transducer  and  activator  1  (ST ATI) 
ETAA16  protein 


ribosomal  protein  S7 
hexabrachion  (tenascin  C) 


PAC 

genomic 

benzodiazapine  receptor 
genomic 

Ig  heavy  chain  variable  region 

ribosomal  protein  LI 09 

glutaraldehyde  3  phosphate  dehydrogenaseH 


ribosomal  protein  S5  mRNA 
ribosomal  protein  L4  mRNA 
argininosuccinate  synthetase  mRNA 
Kappa  casein  precursor  mRNA 
laminin  receptor  mRNA 
ribosomal  protein  S16  mRNA 
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MPH 

R51271 

0.18 

MPH 

U57344 

2e-148 

homeobox  protein  Meis3  mRNA 

MPELiLy 

Z28663 

6e-30 

EST 

MPELiLy 

N58329 

7e-40 

EST/Alu 

MPELiLy 

AA523029 

4e-72 

ribosomal  protein  L9  mRNA 

MPELiLy 

H56276 

3e-51 

carbamoyl-phosphate  synthetase  mRNA 

MPELiLy 

W28842 

9e-89 

EST 

MPELiLy 

AC005368 

e-110 

chr.5 

MPELiLy 

M81310 

0.43 

MPELiLy 

X56932 

0.0 

23KD  highly  basic  protein  mRNA 

MPELiLy 

AJ002030 

3e-54 

putative  progesterone  binding  protein  mRNA 

MPELiLy 

AA4 18401 

e-151 

EST 

MPELiLy 

AC005376 

1.7 

MPELiLy 

AA720919 

3e-14 

EST 

MPELiLy 

L29394 

e-164 

haptoglobin  alpha  2FS  beta  mRNA 

MPELiLy 

AC002350 

3e-45 

Alu 

MPELiLy 

AC003100 

2e-15 

MPELiLy 

AA134339 

0.08 

MPELiLy 

AA195932 

0.0 

EST 

MPELiLy 

AA411274 

2e-97 

MPELiLy 

N58329 

9e-42 

EST/Alu 

MPELiLy 

AC002558 

0.0 

EST 

MPELiLy 

AC002300 

0.088 

MPELiLy 

U42386 

e-175 

fibroblast  growth  factor  mRNA 

MPELiLy 

AL031228 

5e-68 

complement  gene 

MPELiLy 

U00947 

2e-52 

(CAC)n/(GTG)n  containing  mRNA 

MPELiLy 

D26488 

4e- 17 

KIAA0007  gene 

MPELiLy 

Ml  0098 

4e-73 

18S  rRNA 

MPELiLy 

H41283 

6e-07 

EST 

MPELiLy 

AB006198 

0.0 

SART-1  mRNA 

MPELiLy 

AA459599 

e-173 

EST 

MPELiLy 

U92980 

e-113 

DT1P1A10  mRNA 

MPELiLy 

X72889 

e-126 

hbrm  mRNA 

MPELiLy 

C04745 

e-147 

EST 

MPELiLy 

M55670 

e-177 

protein  Z  mRNA 

MPELiLy 

X62996 

0.0 

mitochondrial  DNA 

MPELiLy 

AF001892 

5e-78 

MEN  1  region  epsilon/beta  mRNA 

MPELiLy 

AD000813 

3e-6 

chr.19 

MPELiLy 

U66306 

e-121 

retinoid  X  receptor  alpha  mRNA 

MPELiLy 

H87626 

8e-27 

EST 

MPELiLy 

AL009181 

0.002 

PAC 

MPELiLy 

J02625 

e-165 

cytochrome  P-450j 

MPELiLy 

X62153 

e-116 

PI  protein  mRNA 

MPELiLy 

Z33502 

0.007 

MPELiLy 

AF038186 

8e-95 

clone  23914  mRNA 

MPELiLy 

AB000517 

e-141 

CDP-diacylglycerol  synthetase  mRNA 

MPELiLy 

V00654 

2e-47 

mitochondrial  DNA 

MPELiLy 

AA381346 

3e-5 

EST 

MPELiLy 

U65590 

6e-44 

IL-1  receptor  antagonist  IL-IRa 

MPELiLy 

U48705 

0.009 

MPELiLy 

M18981 

e-100 

prolactin  receptor  associated  protein  mRNA 

MPELiLy 

D55674 

e-105 

heterogeneous  nuclear  ribonucleoprotein  D  mRNA 

MPELiLy 

L33264 

8e-35 

CDC2-related  protein  kinase 

MPELiLy 

AA405094 

e-123 

EST 

MPELiLy 

J01415 

0.0 

mitochondrial  DNA 

MPELiLy 

X03559 

0.0 

Fl-ATPase  beta  subunit  mRNA 

MPELiLy 

L06328 

e-166 

voltage-dependent  anion  channel  protein  mRNA 

MPELiLy 

ABO  10066 

e-34 

RBP56/hTAFII68  mRNA 

MPELiLy 

AL022098 

e-100 

chr.6 

MPELiLy 

Z69723 

0.16 

MPELiLy 

AF001542 

2e-53 

EST 

MPELiLy 

AF03694 

0.45 

MPELiLy 

AA964847 

1.5 

MPELiLy 

AC0001 14 

8e-56 

chr.  Xq23 
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MPELiLy 

U85088 

e-8 

coagulation  factor  XI  gene 

MPELiLy 

AC005952 

5e-8 

chr.  19/Alu 

(III).  In  situ  biopanning 

Breast  cancer  tissue  section 

MPELiLy 

X75349 

e-108 

clotting  factor  IX 

MPELiLy 

U62531 

0.0 

AE2  anion  exchanger 

MPELiLy 

NM000994 

0.0 

ribosomal  protein  L32 

MPELiLy 

AC005998 

e-142 

LI  repeat 

MPELiLy 

NM000969 

6e-63 

ribosomal  protein  L5 

MPELiLy 

AI594532.1 

0.03 

MPELiLy 

NM000064 

0.0 

complement  component  3  (C3) 

MPELiLy 

A  WO  14647 

e-94 

MPELiLy 

NM000040 

6e-99 

apolipoprotein  III 

MPELiLy 

R72920 

8e-59 

EST 

MPELiLy 

NM005032 

e-128 

plastin  3 

MPELiLy 

AL034418 

0.0 

glucosamine-6-sulfatase 

MPELiLy 

AF 159295 

e-106 

18S  rRNA 

MPELiLy 

NM000224 

0.0 

cytokeratin  18 

MPELiLy 

NM002215 

0.0 

inter  alpha  globulin  inhibitor 

MPELiLy 

AW  105663 

7e-47 

EST 

MPELiLy 

U712131 

e-133 

glutathione  S-transferase 

Abbreviation  for  the  phage  display  cDNA  libraries  are  as  follows.  M:  MCF-7,  P:  PC-3,  H: 
HeLa  transfectant,  E:  EJG,  Li:  Liver,  Ly:  Lymph  node,  and  B:  breast  tumor.  Except  the  breast 
tissue  (B)  cDNA  library  which  was  constructed  in  T7Select10-3  vector,  all  the  other  libraries 
were  constructed  in  T7Select  1-1  b  vector.  Frame  of  the  codons  and  orientation  of  the  cDNA 
inserts  varied.  Therefore,  it  should  be  noted  that  homology  in  the  nucleotide  sequence  does  not 
necessarily  implicate  homology  in  the  amino  acid  sequence. 
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transferase  transfers  GalNAc  (iV-acetyl-D-galactosamine) 
residue  to  the  substrate  H  by  alpha  l-»3  glycosidic 
linkage,  whereas  B  transferase  transfers  D-galactose. 
We  used  the  A  transferase  as  a  model  enzyme,  and 
examined  the  feasibility  of  cDNA  cloning  of  glycosyl- 
transferases  by  a  novel  technique  employing  phage 
display. 

The  phage  display  technology  is  based  on  the  surface 
expression  of  the  peptide  sequences  fused  with  phage 
capsid  protein.  Until  recently,  most  applications  were 
restricted  to  clone  phage  particles  expressing  variable 
domains  of  antibodies  specific  to  certain  antigens  (23— 
25).  Phage  display  peptide  libraries  made  with  syn¬ 
thetic  oligonucleotides  have  recently  been  utilized  to 
identify  peptide  sequences  that  interact  with  a  variety 
of  bait  ligands,  such  as  proteins,  peptides,  DNAs, 
RNAs,  and  oligonucleotides  (26,  27).  In  addition  to  the 
in  vitro  selections,  their  applications  have  been  ex¬ 
tended  to  in  vivo  selections  (28,  29).  Surface  display 
was  also  used  for  epitope  mapping  (30).  Coat  protein 
gene  III  of  filamentous  phage  has  been  commonly  used 
for  phage  display,  but  other  fusion  vectors  such  as 
flagellin  gene  (fliC)  of  E.  coli  and  capsid  protein  gene 
10  of  T7  phage  have  become  available.  While  the 
C-terminus  of  the  peptide  sequence  is  fused  with  gene 
III  capsid  protein  in  the  filamentous  phage  system 
(N-terminus  of  the  peptide  is  linked  to  pelB  leader 
sequence),  the  N-terminus  of  peptide  sequence  fuses 
with  gene  10  capsid  protein  in  the  T7  system.  With  this 
T7  system,  it  is  now  possible  to  produce  the  phages 
that  display  protein  sequences  up  to  1200  amino  acid 
residues  long. 

We  constructed  a  phage  display  cDNA  library  using 
RNA  from  cells  that  stably  expressed  A  transferase, 
and  performed  biopanning  experiments  using,  as  a  bait 
ligand,  crude  mucin  fraction  containing  blood  group 
H-specific  glycoproteins.  So  far  no  enrichment  of  the 
phages  that  expressed  A  transferase  fusion  protein  has 
been  obtained.  Unexpectedly,  however,  selective  aug¬ 
mentation  was  observed  of  the  phages  that  expressed 
the  fusion  proteins  with  galectin-3,  a  soluble  /3-galac- 
toside-binding  (S-type)  lectin.  Because  of  this  lectin’s 
known  affinity  with  the  blood  group-specific  oligosac¬ 
charides  (13,  14),  these  results  demonstrated,  for  the 
first  time,  that  the  phage  display  was  successful  in 
cloning  cDNAs  encoding  a  protein  with  binding  capac¬ 
ity  to  carbohydrates. 

EXPERIMENTAL  PROCEDURES 

Construction  of  a  phage  display  cDNA  library.  We  constructed  a 
phage  display  cDNA  library  in  bacteriophage  T7Select  System.  HeLa 
cell  transformant  that  stably  expressed  large  amounts  of  A  antigens 
was  used  for  an  RNA  source.  Total  RNA  was  prepared  with  RNA 
STAT-60  reagents  (TEL-TEST,  Friendswood,  TX),  and  poly  A+  RNA 
was  selected  by  oligo-dT  affinity  chromatography.  Directional  cDNA 
library  was  constructed  by  using  Directional  RH  Random  Primer 
cDNA  Synthesis  Kit  and  T7Select  1-lb  Cloning  Kit  (Novagen,  Mad¬ 


ison  WI).  After  in  vitro  packaging,  a  small  aliquot  was  used  to  titrate 
the  library.  The  remaining  library  was  amplified  prior  to  biopanning. 
Amplified  library  was  also  titrated. 

Crude  extraction  of  mucins.  The  human  stomach  mucosa  of  blood 
group  O  secretor  individuals  were  used  to  prepare  crude  mucin 
fraction  that  contained  blood  group  H-specific  glycoproteins  (31). 
Mucosa  was  homogenized,  digested  with  pepsin,  and  ethanol  precip¬ 
itated.  After  phenol  extraction,  soluble  fraction  was  precipitated 
with  10%  ethanol. 

Biopanning  protocol.  The  MaxiSorp  96  well  plate  (Nunc,  Roches¬ 
ter,  NY)  was  coated  for  two  hours  at  room  temperature  with  50  pi  of 
PBS  solution  containing  5  ng,  50  ng,  500  ng,  5  pg,  or  50  pg  of  crude 
mucin.  After  the  blocking  with  PBS  containing  3%  BSA  overnight  at 
room  temperature,  the  wells  were  washed  with  PBS.  Next,  the 
glycoprotein-coated  wells  were  incubated  for  2  hours  with  5.3  X  108 
phages  in  Tris-buffered  saline  (TBS)  supplemented  with  20  mM 
MnCl2  and  0.05%  Tween  20,  followed  by  washing  with  TBS.  Then  the 
overnight  culture  of  BLT  5403  strain  of  E.  coli  was  added,  the  wells 
were  incubated  for  an  hour,  the  contents  were  transferred  into 
L-Broth  supplemented  with  50  pg/ml  carbenicillin  and  additional 
overnight  culture  of  BLT  5403,  and  the  phages  were  grown  until  lysis 
occurred.  Phage  lysate  was  then  diluted.  The  entire  procedures  were 
repeated  two  more  times. 

PCR  amplification  and  DNA  sequencing.  To  determine  the  en¬ 
richment  effect  of  biopanning,  phage  lysates  containing  selected 
populations  of  phages  from  each  round  of  biopanning  were  used  as 
templates  for  PCR  amplification.  T7  UP  and  DOWN  primers  (Nova¬ 
gen)  corresponding  to  the  sequences  in  the  phage  vector  DNA  that 
neighbor  the  inserts  were  used  to  amplify  cDNA  sequences.  PCR  was 
performed  in  20  pi  of  a  reaction  mixture  containing  lx  PCR  buffer 
(10  mM  Tris-HCl  (pH  8.3),  50  mM  KC1,  2.5  mM  MgCl2,  0.001% 
Gelatin),  10%  DMSO,  250  pM  dNTP  mix,  40  pmol  each  of  primers, 
and  1  units  of  Taq  DNA  polymerase  (BRL-Life  Technologies,  Gaith¬ 
ersburg,  MD).  Amplified  fragments  were  analyzed  through  3%  aga¬ 
rose  gel  electrophoresis,  gel  purified  with  GeneClean  II  Kit  (BIO  101, 
Vista,  CA),  and  subjected  to  DNA  sequencing  using  PRISM  Dye 
Terminator  Cycle  Sequencing  Ready  Reaction  Kit  (PE-ABI,  Foster 
City,  CA)  directly  and  after  T-A  cloning  into  the  plasmid  vector, 
pT-Adv  (Clontech,  Palo  Alto,  CA). 

Probe  preparation  and  hybridization.  Plasmid  DNA  containing 
the  galectin-3  cDNA  sequence  was  digested  with  EcoRI.  The  insert 
DNA  was  gel-purified  after  3%  agarose  gel  electrophoresis.  Random 
hexamer  labeling  method  was  employed  to  radiolabel  the  fragment 
using  32P-dCTP  (NEN-Dupont,  Boston,  MA)  and  Random  Primed 
DNA  Labeling  Kit  (Boehringer-Mannheim,  Indianapolis,  IN).  PCR- 
amplified  DNA  from  the  selected  populations  of  phages  analyzed 
through  agarose  gel  electrophoresis  as  above  was  denatured,  neu¬ 
tralized,  Southern-transferred  onto  a  Nylon  membrane,  and  UV 
cross-linked.  Approximately  500  phages  were  plated  from  each  se¬ 
lection,  plaque-transferred  onto  Nylon  membranes  and  processed. 
These  membrane  filters  were  hybridized  overnight  with  the 
galectin-3  cDNA  probe  in  Pipes  buffer  (40  mM  Pipes  (pH  6.4),  1  mM 
EDTA  (pH  8.0).  0.4  M  NaCl)  with  80%  formamide  and  1%  SDS  at 
42°C.  The  filters  were  then  washed  and  exposed  to  the  X-ray  film. 

RESULTS 

We  constructed  a  HeLa  cell  transformant  phage  dis¬ 
play  cDNA  library  in  T7Select  1-lb  vector.  Most,  if  not 
all,  of  the  glycosyltransferases,  including  A  trans¬ 
ferase,  possess  their  catalytic  domains  at  the 
C-terminal  side  (300-550  amino  acid  residues  depend¬ 
ing  on  the  enzymes)  of  the  proteins.  Although  the 
transmembrane  domain  is  important  for  the  Golgi  lo¬ 
calization,  several  experimental  data  suggested  that 
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M  1  2  3  4  5  6  7  8  9  10  11  12  13  14  15  16  17  M 


(B) 


FIG.  1.  PCR  amplification  of  cDNA  inserts  from  phage  popula¬ 
tions  selected  by  biopanning.  (A).  Phage  lysates  were  heat-denatured 
before  being  used  as  templates  for  PCR.  Lysates  used  are  from; 
original  phage  display  HeLa  cell  transformant  cDNA  library  (lane  1), 
phage  populations  obtained  after  one  (lane  2),  two  (lane  3),  and  three 
(lane  4)  rounds  of  selections  with  5  ng,  or  after  one  (lane  5),  two  (lane 
6),  and  three  (lane  7)  rounds  of  selections  with  50  ng,  or  after  one 
yane  8),  two  (lane  9),  and  three  (lane  10)  rounds  of  selections  with 
'500  ng,  or  after  one  (lane  11),  two  (lane  12),  and  three  (lane  13) 
rounds  of  selections  with  5  jug,  or  after  one  (lane  14),  two  (lane  15), 
and  three  (lane  16)  rounds  of  selection  s  with  50  fig  of  blood  group- 
specific  glycoproteins.  Lane  17  shows  the  result  of  PCR  without 
lysate  as  a  negative  control.  M  denotes  100-bp  DNA  ladder  marker. 
(B).  DNA  was  transferred  from  the  above  gel  onto  a  Nylon  membrane 
filter,  and  the  filter  was  hybridized  with  a  32P-radiolabeled  galectin-3 
specific  probe.  A  total  of  five  bands  were  hybridized  with  the  probe. 


this  signal/anchor  domain  and  the  N-terminal  cyto¬ 
plasmic  domain  were  not  required  for  the  enzymatic 
activity  (32).  T7Select  1-lb  display  vector  can  display 
on  the  surface  of  T7  capsid  proteins  up  to  1,200  amino 
acid  residues  in  size.  Target  sequences  are  fused  to  the 
C-  terminus  of  the  10B  capsid  protein.  Therefore,  this 
vector  was  chosen  to  assess  the  feasibility  of  phage 
display  approach  to  clone  glycosyltransferase  cDNAs. 

For  an  RNA  source,  we  used  HeLa  cell  transformant 
that  expressed  A  transferase.  The  library  contained 
5.3  X  106  independent  phage  clones,  and  the  average 
insert  size  was  760  bp.  Because  only  0.1-1  copies  of 
fusion  proteins  are  displayed  on  T7Select  1-lb  phages, 
one  hundred  times  the  size  of  library  (5.3  X  108  phages) 
was  used  for  the  initial  round  of  biopanning  selections 
with  blood  group  glycoproteins,  the  substrate  for  A 
transferase  enzymatic  reaction,  as  a  target. 

We  anticipated  the  enrichment  of  phages  that  dis¬ 
played  the  A  transferase  structure  through  the  rounds 
of  selections.  Results  from  the  plaque  hybridization 
experiments  of  the  selected  populations  of  phages  us¬ 


ing  a  radiolabeled  A  transferase  cDNA  probe,  however, 
showed  no  signs  of  enrichment  (data  not  shown).  Then 
we  examined  whether  certain  populations  of  phages 
might  be  enriched  during  these  biopanning  procedures. 
Using  the  heat-denatured  phage  lysate  as  a  template 
for  PCR,  we  amplified  cDNA  inserts  from  the  selected 
populations  of  phages,  and  observed  enrichment  of  cer¬ 
tain  phages  as  judged  by  the  increased  band  intensity 
through  the  rounds  of  selections. 

Enrichment  was  observed  irrespective  of  the  target 
concentrations  (Fig.  1A).  A  single  major  band  was  am¬ 
plified  from  the  phage  population  obtained  after  three 
rounds  of  selections  with  50  fig  of  the  target  ligand. 
When  5  fig  and  500  ng  of  targets  were  used,  three  and 
two  major  bands  were  amplified,  respectively.  Enrich¬ 
ment  was  also  seen  with  the  phage  populations  se¬ 
lected  with  lesser  concentrations  of  the  targets.  Nucle¬ 
otide  sequences  of  these  PCR-amplified  bands  were 
determined  by  direct  sequencing  as  well  as  after  clon¬ 
ing  into  a  plasmid  vector.  Surprisingly,  several  of  these 
sequences  were  derived  from  the  cDNA  sequence  en¬ 
coding  galectin-3,  a  member  of  carbohydrate-binding 
proteins  known  as  lectin.  Nucleotide  and  the  deduced 
amino  acid  sequences  of  those  bands  from  galectin-3 
cDNA  are  schematically  shown  around  the  fusion  junc¬ 
tion  in  Fig.  2.  Although  the  locations  and  sizes  of  the 
inserts  varied,  their  reading  frames  were  identical  to 
that  of  the  fused  partner  T7  capsid  protein.  Their 
C-terminal  ends  were  conserved. 

Galectin-3,  also  known  as  IgE-binding  protein  (epsilon- 
BP),  carbohydrate-binding  protein  35  (CBP  35),  or  as 
macrophage  cell-surface  protein  Mac-2,  was  originally 
identified  in  rat  basophilic  leukemia  cells  by  virtue  of  its 


Phages 

298 

clone  1 .  G  AAT  TCA  AGC  GCC  TAC  CCT  GCC  ACT  GGC  CCC  TAT  . . . 
100 

Ala  Tyr  Pro  Ala  Thr  Gly  Pro  Tyr  . . . 

clone  2.  G  AAT  TCA  AGC  GCC  AGC  AGC  CGT  CCG  GAG  CCA  GCC  AAC 
1 

GAG  CGG  AAA  ATG  GCA  GAC  AAT  TTT  TCG  CTC  CAT  .  .  . 

1 

Met  Ala  Asp  Asn  Phe  Ser  Leu  His  . . . 
232 

clone  3 .  G  AAT  TCA  AGC  GTC  TAC  CCA  GGG  CCA  CCC  AGC  GGC 
78 

Val  Tyr  Pro  Gly  Pro  Pro  Ser  Gly  . . . 

-  346 

clone  4 .  G AAT  TCA  AGC  GTG  CCT  TAT  AAC  CTG  CCT  TTG  CCT  . . . 
116 

Val  Pro  Tyr  Asn  Leu  Pro  Leu  Pro  . . . 

FIG.  2.  Nucleotide  and  the  deduced  amino  acid  sequences  of  phage 
clones  containing  galectin-3  cDNA  sequence.  Nucleotide  and  the  de¬ 
duced  amino  acid  sequences  from  the  four  phage  clones  that  expressed 
galectin-3  fusion  proteins  are  schematically  shown  around  the  Eco  RI 
fusion  junctions  with  gene  10  B  in  the  T7Select  1-lb  Vector.  The  A 
residue  of  initiation  codon  and  the  initiation  codon  are  numbered  1  in 
the  nucleotide  and  the  deduced  amino  acid  sequences,  respectively.  The 
sequences  derived  from  the  Eco  RI  linker  are  underlined. 
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Panning  Target 

5  ng  50  ng  500  ng  5  |ig  50  ng 


FIG.  3.  Hybridization  of  phage  plaques  with  a  galectin-3  probe.  Phages  from  selections  were  plated  at  approximately  500  plaques  per 
plate.  After  plaque-transfer  onto  Nylon  membrane,  the  membranes  were  hybridized  with  the  galectin-3  probe. 


affinity  with  IgE  (33).  Human  galectin-3  possesses  an 
open  reading  frame  of  750  base  pairs  encoding  a  250- 
amino-acid  protein.  The  N-terminal  41  amino  acids  are 
homologous  to  a  serum  response  transcription  factor,  and 
the  adjacent  sequence  (aa  42-106)  contains  the  repetitive 
proline- glycine-rich  motif.  The  C-terminal  domain  con¬ 
tains  CRD,  a  sequence  motif  shared  by  other  S-type  lec¬ 
tins.  This  domain  is  responsible  for  the  binding,  and  is 
well  conserved  among  different  species  of  mammals  (34). 
Although  all  S-type  lectins  bind  /3-galactoside,  the  speci¬ 
ficity  of  each  lectin  seems  to  be  unique  (13,  35).  Our 
results  confirmed  the  ability  of  galectin-3  to  bind  the 
blood  group  structures  (13,  14). 

We  examined  the  selective  enrichment  of  phages 
expressing  galectin-3  fusion  proteins  during  biopan¬ 
ning  by  two  methods;  the  Southern  hybridization  and 
the  plaque  hybridization.  DNA  was  transferred  from 
the  agarose  gel,  shown  in  Fig.  1A,  onto  a  Nylon  mem¬ 
brane,  and  the  membrane  was  hybridized  with  a  radio- 
labeled  galectin-3  cDNA  probe.  The  result  is  shown  in 
Fig.  IB.  Five  bands  (one,  three,  and  one  from  selections 
with  50  /xg,  5  jag,  and  500  ng  of  target  ligand,  respec¬ 


tively)  were  hybridized  with  the  probe.  Lysates,  con¬ 
taining  selected  populations  of  phages,  were  plated  at  a 
density  of  approximately  500  phages  per  plate.  DNA 
was  transferred  onto  Nylon  membranes,  and  these 
membranes  were  used  for  the  plaque  hybridization 
with  the  galectin-3  probe.  Results  from  the  plaque  hy¬ 
bridization  experiments  are  shown  in  Fig.  3.  The  num¬ 
bers  of  the  hybridized  plaques  were  counted,  enrich¬ 
ment  factors  were  calculated,  and  the  results  are 
shown  in  Table  1.  These  two  hybridization  experiments 
clearly  demonstrated  the  selective  amplification  of 
phages  that  expressed  galectin-3  structures.  The  re¬ 
sults  also  showed  that  the  selection  was  target  concen¬ 
tration  dependent.  Although  the  enrichment  was  ob¬ 
served  even  when  the  bait  ligand  was  used  in  lower 
concentrations,  the  sequences  from  the  enriched 
phages  were  not  related  to  the  galectin-3  sequence. 

DISCUSSION 

We  obtained  the  results  that  demonstrate  the  feasi¬ 
bility  of  phage  display  technology  for  the  purposes  of 
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TABLE  1 

Enrichment  of  Phages  That  Express 
GaIectin-3  Fusion  Proteins 


Target  ligand 
amount 

Round  of 
selections 

No.  of 
total 
plaques 

No.  of 
galectin-3 
positive 
plaques 

Enrichment 

Original  library 

None 

8122 

1 

1 

5  ng 

1st 

412 

0 

— 

2nd 

642 

0 

— 

3rd 

602 

0 

— 

50  ng 

1st 

502 

0 

— 

2nd 

519 

0 

— 

3rd 

493 

0 

— 

500  ng 

1st 

301 

0 

— 

2nd 

391 

2 

41.6 

3rd 

625 

247 

3210.4 

5  nS 

1st 

610 

20 

266.3 

2nd 

650 

267 

3336.9 

3rd 

610 

525 

6991.5 

50  ng 

1st 

374 

16 

347.5 

2nd 

312 

83 

2161.1 

3rd 

536 

359 

5440.9 

Note.  Enrichment  of  phages  expressing  galectin-3  fusion  protein 
was  examined  by  plaque  hybridization  of  phages  from  each  round  of 
selection. 


detecting  carbohydrate-protein  interaction  and  cloning 
phage  particles  displaying  fusion  proteins  with  carbo¬ 
hydrate  affinity.  The  experiments  were  originally  in¬ 
tended  to  evaluate  the  capability  of  cloning  cDNAs 
encoding  glycosyltransferases  using  this  novel  technol¬ 
ogy.  We  examined  whether  the  phages  that  expressed 
A  transferase  fusion  protein  could  be  enriched  by  serial 
rounds  of  biopannings  to  the  target  containing  blood 
group  H-specific  oligosaccharide  structures,  the  sub¬ 
strates  for  this  enzyme.  Rather  than  A  transferase,  we 
observed  enrichment  of  phages  that  expressed 
galectin-3  fusion  proteins  after  the  selections  with  rel¬ 
atively  higher  concentrations  of  target.  This  enrich¬ 
ment  was  specific,  without  doubt,  because  multiple 
number  of  phage  clones  containing  different  portions  of 
the  galectin-3  were  obtained.  Enrichment  was  also  ob¬ 
served  with  lower  concentrations  of  target,  however,  it 
remains  to  be  resolved  whether  or  not  the  enrichment 
was  meaningful  because  multiple  numbers  of  clones 
containing  different  portions  of  the  identical  sequences 
were  not  obtained  in  these  cases. 

Enrichment  of  phages  that  express  galectin-3  was 
unexpected.  However,  considering  the  reported  strong 
affinity  of  certain  S-type  lectins  to  the  blood  group 
structures,  this  may  not  be  astonishing.  Nonetheless, 
the  phage  display  cDNA  cloning  was  shown  to  be  suc¬ 
cessful  in  targeting  at  the  proteins  with  carbohydrate 
affinity.  The  same  approach  may  be  useful  to  clone 
cDNAs  that  encode  other  carbohydrate-binding  pro¬ 
teins,  using  the  appropriate  target  ligands  for  selec¬ 


tions.  If  the  binding/washing  conditions  are  optimized, 
cDNA  cloning  of  proteins  with  lower  affinity  may  be 
possible.  Various  modifications  of  conditions  may  be 
workable.  The  temperature  for  phage  binding  can  be 
changed.  Concentration  of  calcium  and  other  cations 
may  be  critical  for  those  proteins  that  depend  on  these 
ions  for  binding.  Phage  display  system  allows  both 
evaluation  of  the  presence  of  other  types  of  lectins  and 
the  determination  of  their  identity.  Different  from  gly¬ 
cosyltransferases,  however,  some  lectins,  such  as  selec- 
tins,  possess  CRD  at  the  N-terminus.  Therefore,  use  of 
two  complementary  vector  systems  that  permit  the 
fusion  at  both  the  N-  and  the  C-terminus  may  become 
necessary. 

Selection  of  glycosyltransferase  cDNAs  by  phage  dis¬ 
play  approach  seems  to  be  difficult,  even  if  not  impos¬ 
sible,  especially  when  competitive  phages  that  exhibit 
fusion  proteins,  such  as  lectins,  with  stronger  binding 
properties  to  the  same  target  are  present  in  the  library. 
The  messages  for  lectins  are  generally  more  abundant 
than  those  for  glycosyltransferases.  Therefore,  the 
source  of  RNA  for  the  library  construction  may  be 
crucial.  Human  A  transferase  was  previously  purified 
from  lungs  to  the  homogeneity  through  several  steps  of 
procedures  including  Triton  X-100  extraction,  Sepha- 
rose  4B  (UDP)  chromatography,  cation-exchange  chro¬ 
matography,  and  reverse  phase  chromatography  (36). 
Because  we  were  unsure  whether  the  phages  would 
tolerate  these  procedures  and  because  we  had  crude 
mucin  fraction  with  blood  group  H  activity,  we  took  a 
shortcut  approach  by  utilizing  the  affinity  of  the  en¬ 
zyme  to  the  substrate.  Chromatographic  steps  similar 
to  some  of  those  used  in  enzyme  purification  may  be 
applied  to  isolate  the  phages  displaying  A  transferase. 
Use  of  high  copy  number  display  T7Select  415b  Vector 
may  increase  the  valence  and  enhance  the  phage’s 
affinity  with  the  ligand.  This  vector,  however,  can  only 
incorporate  a  maximum  of  39  amino  acids,  which  may 
be  insufficient  to  code  for  CRD  when  taking  into  ac¬ 
count  the  general  belief  that  about  100  amino  acid 
residues  are  necessary  to  form  a  self-contained  sugar¬ 
binding  site  (12).  Future  success  of  cloning  glycosyl¬ 
transferase  cDNAs  by  phage  display,  therefore,  may 
heavily  depend  on  the  development  of  new  vectors  that 
display  multiple  copies  of  long  protein  sequences. 

Lectins  are  involved  in  pathological  as  well  as  phys¬ 
iological  phenomena,  and  may  also  play  an  important 
role  in  progression  and  metastasis  of  cancer.  We  have 
shown  that  this  novel  approach  of  phage  display  is 
useful  in  identifying  proteins  with  carbohydrate¬ 
binding  properties.  Using  this  technique,  new  types  of 
lectins,  if  any,  may  be  identified,  CRD  epitopes  may  be 
mapped,  and  our  understanding  of  the  structures  and 
functions  of  these  molecules  involved  in  carbohydrate- 
protein  interactions  may  be  greatly  advanced. 
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